Open In Colab

PoincaréMSA: Poincaré maps for visualization of large protein famillies

PoincareMSA builds a projection of protein multiple sequence alignemnt (MSA) on a Poincaré disk. The proximity of the points to the disk center corresponds to their hierarchy and correlates with the proximity of the proteins to the root of the phylogenetic tree. Thus, must central point often correspond to the ancestor proteins and protein located close to the border to the leaves of phylogenetic tree.

Notebook initialization

Load dependencies

Data import

Settings

Data preparation and data projection using Poincaré disk

Projection visualization

 Save plot to file